Korpus: est_news_2011_100K

Weitere Korpora

3.6.2 Zipf's law for words of fixed lengths

Zipf distribution of words of fixed length 4, 6, 8, ..., 14.


Zipf's diagram for words of fixed length


Gnuplot diagram

Top Words of length 4
word rank frequency word
1 6578 ning
2 4976 siis
3 4119 pole
4 3494 seda
5 3036 kuid
Top Words of length 6
word rank frequency word
1 1810 pärast
2 1686 aastal
3 1595 midagi
4 1519 rohkem
5 1471 tagasi
Top Words of length 8
word rank frequency word
1 1543 Tallinna
2 886 inimesed
3 869 võimalik
4 777 lihtsalt
5 531 vähemalt
Top Words of length 10
word rank frequency word
1 499 tegelikult
2 316 tähelepanu
3 237 presidendi
4 159 Kuressaare
5 159 inimestele
Top Words of length 12
word rank frequency word
1 226 avaldamiseks
2 192 tõenäoliselt
3 191 Keskerakonna
4 83 kommenteeris
5 82 esmakordselt
Top Words of length 14
word rank frequency word
1 105 pressiesindaja
2 90 rahvusvahelise
3 79 Reformierakond
4 68 Eesti Ekspress
5 65 linnavalitsuse
Slope for length 4
Slope
-1.1037498616536525
Slope for length 6
Slope
-0.7785668433161497
Slope for length 8
Slope
-0.6483325951307655
Slope for length 10
Slope
-0.5586356478278822
Slope for length 12
Slope
-0.5246090113350907
Slope for length 14
Slope
-0.7196663469151313
574 msec needed at 2018-02-27 06:03